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We examine the cluster- size dependence of the cellular dynamical mean-field theory (CDMFT) applied to 
the two-dimensional Hubbard model. Employing the continuous-time quantum Monte Carlo method as the 
solver for the effective cluster model, we obtain CDMFT solutions for 4-, 8-, 12-, and 16-site clusters at a 
low temperature. Comparing various periodization schemes, which are used to construct the infinite-lattice 
quantities from the cluster results, we find that the cumulant periodization yields the fastest convergence for the 
hole-doped Mott insulator where the most severe size dependence is expected. We also find that the convergence 
is much faster around (0,0) and (f , f ) than around (7r,0) and (7r,7r). The cumulant-periodized self-energy 
seems to be close to its thermodynamic limit already for a 16-site cluster in the range of parameters studied. 
The 4-site results remarkably agree well with the 16-site results, indicating that the previous studies based on 
the 4-site cluster capture the essence of the physics of doped Mott insulators. 



I. INTRODUCTION 

A range of anomalies observed in the normal state of high- 
Tc cuprates indicates a momentum- space differentiation of 
the electronic structure. For instance, the pseudogap^ and 
Fermi arc^'^ observed by the angle-resolved photoemission 
spectroscopy suggest a gap in the single-particle excitation 
spectra around antinodal points [i.e., (tt, 0) and its symmet- 
rically related points in the Brillouin zone] while there is 
a metallic spectrum around nodal points [i.e., (f , f ) and 
its symmetrically related points]. Since undoped cuprates 
are considered Mott insulators^, these findings have directed 
the attention to doped Mott insulators where strong elec- 
tronic correlations may cause unprecedented metallic states. 
The simplest play-ground model which describes the doped 
Mott insulators is the Hubbard model in two dimensions. 
In fact, various numerical calculations for the model sug- 
gest a momentum- space differentiation of electronic proper- 
ties at small doping,^"^^ similarly to the experimental results 
on cuprates. Nevertheless, many fundamental questions re- 
main unresolved on how the metal-Mott insulator transition 
takes place in two dimensions: How does the Fermi surface 
evolve while approaching the Mott insulator? Is there a Fermi 
arc or Fermi pocket in the underdoped region? Is the length 
of this Fermi arc/pocket going to zero while approaching the 
Mott insulator? Or/and is the weight of the quasiparticle exci- 
tations fading away at the Mott transition? Is there a quantum 
critical point and/or non-Fermi liquid phase in between the 
Fermi liquid and the Mott insulator? Is the Luttinger sum rule 
fulfilled in the whole doping range? 

To address these issues, we need a method that can describe 
sufficiently well the momentum dependence of the electronic 
structure. We also need to treat on the same footing the low 
and high energy scales, since the electronic structure is re- 
constructed on a wide energy range in Mott-related phenom- 
ena. Schemes which have been developed to fulfill these re- 
quirements are for instance the cluster extensions of the dy- 
namical mean-field theory (DMFT)^^, such as the cellular 
DMFT (CDMFT) and the dynamical cluster approximation 
(DCA).^^ Both theories map the Hubbard model onto an effec- 



tive model consisting of an interacting small cluster and non- 
interacting infinite bath. CDMFT defines the effective model 
in real space while the DCA defines the effective model in 
momentum space (see Fig. 1 for an illustration). In both theo- 
ries, the resolution in momentum space is limited by the clus- 
ter size, so that a high momentum resolution requires a large 
cluster, intractable with present computational resources. 

The advantage of DCA is that it keeps the translational 
symmetry of the original lattice. This allows one to define 
cluster momenta by partitioning the Brillouin zone into sev- 
eral patches [Fig. 1(b)]. The momentum dependence of the 
self-energy within each patch is neglected. Then, the coarse- 
grained Green's function, which is employed in the self- 
consistency loop, is defined by averaging over all momenta 
within each patch (e.g., including momentum on the Fermi 
surface and far away from it). Hence, in the parameter re- 
gion where the momentum- space differentiation is crucial, the 
analysis with a small cluster may lose important information. 
For example, a recent systematic study on the cluster-size de- 
pendence in the DCA^^ has shown that the nodal- antinodal 
momentum- space differentiation is not clearly seen with the 
4-site cluster patched as in the left panel in Fig. 1(b), while it 
is seen with another choice of the momentum patch (and with 
larger clusters). 

On the other hand, CDMFT is performed by defining a 
real- space finite- size cluster. It explicitly takes into account 
short-range correlations within the cluster. This scheme does 
not need the average over momentum, but instead, it breaks 
the translational invariance, which has to be restored only at 
the end by deriving lattice quantities from the cluster ones 
through a periodization procedure. This latter problem has 
been analyzed by some of the authors^^^^ using the 2x2- 
cluster CDMFT. The results suggest that a normal Fermi liq- 
uid, realized at high doping, evolves by reducing doping to- 
ward a Mott insulator through (at least) two quantum phase 
transitions to non-Fermi liquid phases by topological changes 
of the Fermi surface and the appearance of a surface of zeros 
of Green's function (i.e., poles of the self-energy). The non- 
Fermi liquid phases have hole-pocket Fermi surfaces at low 
doping and show arclike spectra at finite temperatures. 
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In order to check how robust early CDMFT results on the 
2x2 and 2x1 clusters^"'^'^^^^'^^ are against the increase of 
cluster size A^c, larger cluster studies have been highly de- 
sired. A systematic study by increasing cluster size helps also 
to identify the best periodization scheme which could reach 
the thermodynamic limit in the fastest way.^^ This would al- 
low us to understand to what extent small cluster calculations 
(still accessible by present computational limits) capture the 
right physical results, once implemented with the most suit- 
able periodization scheme. 

In this paper, we extend CDMFT up to 16-site cluster, em- 
ploying the continuous-time quantum Monte Carlo method 
(CTQMC)^^ as the solver for the effective models. By com- 
paring the solutions for the 4-, 8-, 12-, and 16-site clusters il- 
lustrated in Fig. 1(a), we systematically study the cluster-size 
dependence of the single-particle Green's function and the 
self-energy obtained through various periodization schemes. 
We find that the cumulant periodization^ gives the fastest con- 
vergence against the cluster size for slightly doped Mott insu- 
lators, where the size dependence is severest due to the strong 
momentum dependence of the self-energy. We also find that 
the convergence is strongly momentum-dependent: It is faster 
around (0, 0) and (f , f ), where the results for 16 sites seem 
nearly converged, than around (tt, 0) and (tt, tt). Remarkably 
the 2x2 cluster results are found to agree well with the 4x3 
and 4x4 cluster results. In particular, the location of the Fermi 
arc and the zero surface of Green's function at the Fermi level 
does not depend significantly on the cluster size. This sug- 
gests that previous CDMFT results obtained by using a 4- 
site cluster^'^'^^"^^ capture an essential physics of the slightly 
doped Mott insulators. 

The paper is organized as follows. In Sec. II we briefly 
introduce the Hubbard model, CDMFT, and various peri- 
odization schemes (in Appendix A the different periodization 
schemes are discussed from a different perspective). The nu- 
merical results obtained by CDMFT are discussed in Sec. Ill: 
We first study the inhomogeneity and locality of various clus- 
ter quantities, i.e., self-energy, cumulant, and Green's func- 
tion, and then discuss the periodized quantities for various 
square clusters whose edges are parallel to the lattice vectors, 
with a focus on the low-energy electronic structure. The re- 
sults for tilted clusters of Nq = 8 and 12 are presented in 
Appendix B, where the effect of the cluster geometry is dis- 
cussed. We summarize the results in Sec. IV. 



II. MODEL AND METHOD 

We study the two-dimensional single-band Hubbard model 
on a square lattice. The Hamiltonian reads 

^ = Y1 ^(^)4aCka - ^"^rii^^uY^ rii^riii, (1) 

kcr icT i 

where c^^ {c\^a) annihilates (creates) an electron with spin a 
and momentum k = {kx^ ky), c-^ (cj^) is its Fourier compo- 
nent at site i, and riia- = cj^c-^. Here, U represents the onsite 
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FIG. 1. (Color online), (a) Cluster geometries employed in the 
present CDMFT study. For comparison, (b) shows the typical mo- 
mentum patches used in DCA for the same cluster sizes, where the 
dots denote the central momenta in each patch. 



Coulomb repulsion, /i the chemical potential, and 

e(k) = — 2t(cos kx + cos ky) — At' cos kx cos ky^ (2) 

where t {t') is the (next-)nearest-neighbor transfer integral. 
We solve the model within CDMFT, restricting the solu- 
tion to the paramagnetic state to focus on Mott physics. We 
adopt a set of parameters appropriate for hole-doped cuprates: 
= -0.2t and /7 = 8^,^^ for which the CMDFT solution is 
a Mott insulator at half filling (n = 1) for all cluster sizes 
we consider. We study three values of hole doping, 1, 3, 
and 5% and fix the temperature to T = 0.06t. Because of 
the well-known fermionic sign problem, which becomes sev- 
erer with decreasing filling and temperature, we cannot reach 
larger dopings and/or lower temperatures for our largest clus- 
ter TVc = 16. In CDMFT the infinite lattice quantum problem 
(1) is mapped onto an effective cluster model, 

^eff=- / ^rdr^^^4(r)eo:^^.(r-rOc^-.(rO 

Jo ■ 

(3) 

consisting of an TVc-site cluster embedded in a bath of non- 
interacting fermions, which is described by a dynamical Weiss 
field matrix Qo at the inverse temperature {3 = ^. 

We solve the cluster model by means of CTQMC.^^ We 
choose the interaction-expansion variant^^"^"^ which is the 
most suited approach to address large clusters. Recent 
progress^^ in the updating algorithm in the CTQMC enables 
us to reach much lower temperatures than those in previous 
studies. The CDMFT-CTQMC approach incorporates all the 
correlations within the cluster, so that it converges to the exact 
solution as Nq oo. Here we study 4-, 8-, 12-, and 16-site 
clusters with the geometries illustrated in Fig. 1(a). The in- 
creasing size of the Green's function matrix as well as the in- 
creasing negative signs of the QMC samples prevents us from 
studying even larger clusters. 



3 



The effective cluster model is subject to the CDMFT self- 
consistency condition which relates the cluster one-particle 
Green's functions Gfj{iuJn) = - J dre'"^^^ (T^Ci(r)c](0)) 
to the lattice Green's function of the original model, 



Nc 
(2^ 



G(k, iun) = {iuJn ^ IJi)i - t(k) - t{iuJn) 



(4) 



(5) 



where T,{iuOn) = ^o(^^n) ^ — G^iioOn) ^ (all being 
matrices with respect to cluster sites). Here, tij(k) = 

X;K^"'*^^^*^^^'^'"'^'^^(K+k) is the single-electron part of 
the Hamiltonian^ written in the reduced Brillouin zone (RBZ) 
of the cluster, k is the wave vector in the RBZ, K the re- 
ciprocal vector of the cluster, and the intra-cluster vector 
coordinate. 

So far CDMFT has been in most cases employed for 2x2 
(or smaller) cluster calculations. Some exceptions are a sys- 
tematic analysis with Lanczos methods for one-dimensional 
clusters up to eight sites, a QMC analysis of the cluster- 
size dependence in one and two dimensions, a study about 
the different geometries in d = 2 in Ref. 26, and 8- and 16- 
site calculations performed by some of the authors. As we 
already mentioned, CDMFT clusters break the translational 
symmetry of the lattice; they are defined with open bound- 
ary conditions. This means that, unlike in DC A, cluster mo- 
menta are no longer good quantum numbers. This is not a 
problem as long as one stays in the basis defined by the good 
quantum numbers of CDMFT, namely, the irreducible repre- 
sentations of the point group of the cluster. However, if one 
wants to compare with, say, angle-resolved photoemission ex- 
periments, k-resolved spectral functions are needed. Several 
"estimators" for lattice quantities have been proposed and in 
the recent literature there has been an intensive discussion on 
what is the best strategy to produce good k-resolved spec- 
tral functions.^' ^^'^^'^^'^^ There are two main issues: one would 
like the cluster quantity that has to be periodized (i) to be as 
localized as possible, in order to minimize the impact of the 
approximation of neglecting inter-cluster correlations and (ii) 
to be as homogeneous as possible, so that translational invari- 
ance is (almost) fulfilled. 

If criteria (i) and (ii) are both met for the cluster quantity 
Q^{i(jJn), it is useful to define the corresponding lattice quan- 
tity by the Fourier expansion. 



Q^(k,icjn) 



1 ^ 
Nc ^ ^ 



(6) 



truncated by the cluster size, since the longer-range terms 
would be negligible. Here k is defined on the entire Brillouin 
zone of the original lattice, and, as introduced above, and 
Tj are the real- space vectors specifying the cluster sites i and 
j, respectively.^^ 

Two different choices for Q have been mainly proposed 
so far: The cluster self-energy E (Ref. 6) and the cumulant 
M = [iujn + /i - S]"^ (Ref. 7). In the following, we call 



the two schemes E periodization and M periodization, re- 
spectively. It is expected that in the weak-coupling regime 
the self-energy is small and the cumulant is well localized so 
that both choices can be appropriate. On the other hand, in 
the strong-coupling regime, while the self-energy is large and 
nonlocal, the cumulant is well localized so that Q = M is 
expected to be a more appropriate choice.^' For Q = E, ac- 
cording to Eq. (6), the lattice Green's function is given by 



G^(k, iun) = [iuJn + M - e(k) - E^(k, iujn) 



(7) 



while, for Q = M, the lattice Green's function is given by 

GL(k, iujn) = [ML(k, iujn)-' - e{k)]-\ (8) 
and the self-energy reads 

E^(k, iuJn) =iUn^lJi- M^(k, iUn)~^. (9) 

We can also use Green's function in order to build the 
translational-invariant object. We call this procedure G pe- 
riodization. In this case, however, we have to modify Eq. (6). 
Indeed Eq. (6) with = G^ does not reproduce the correct 
lattice dispersion, as shown in Appendix A. A prescription for 
this was previously proposed in Ref. 28 and it has been al- 
ready used in CDMFT studies, in particular in Ref. 13. The G 
periodization formula reads 



1 



(10) 



where Gij{k^iuJn) with k = k modulo K is defined in 
Eq. (5). In this case we obtain the lattice self-energy through 

^^{k.iujn) = iuJn ^ 1^ - e{k) - G^{k,iujn)~\ (11) 

In a large-frequency expansion (see Appendix A), these 
three periodization schemes give the same lattice Green's 
function up to the second order. We expect, however, that 
the M and G schemes are closer to each other than to the E 
scheme near the Mott insulator (see again Appendix A for de- 
tails). 

A difference between the three periodization schemes be- 
comes prominent only at low energy. Hence, in the next sec- 
tion, we focus on low-energy behaviors. We first compare the 
locality and inhomogeneity of the cluster quantities Q^, which 
determine the accuracy of the periodization (6). Even though 
Eq. (10) in the G periodization is not a truncated Fourier trans- 
formation, the cluster Green's function (4) is still relevant be- 
cause it gives the average of Green's function (5) over RBZ. 
Therefore, we examine the locality and homogeneity of M^, 
E^, and G^ in the next section. 

We will show that, in the parameter region of lightly hole- 
doped cuprates, the cumulant is at the same time well lo- 
calized and fairly homogeneous over the cluster so that it 
certainly represents the best choice as an "estimator" of lat- 
tice quantities. Compared to DC A the advantage is that k- 
resolved quantities are not smeared out because the averages 
within each momentum patch of the Brillouin zone employed 
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FIG. 2. (Color online). Absolute values of local components of 
(a) the cluster cumulant (Q = M), (b) self-energy (Q = S) and 
(c) Green's function (Q = G) against the Matsubara frequency for 
Nc — 16 and n = 0.95. The index i denotes the cluster sites num- 
bered in the right-most panel of Fig. 1(a). The errorbars are below the 
symbol size, except for those at the lowest Matsubara frequency, (d) 
Comparison of the normalized deviations defined in Eq. (12), which 
shows that the cumulant expansion is the most homogeneous. 



in DCA are not taken in CDMFT. The disadvantage is that the 
periodization is not unique and it involves an artificial average 
of cluster quantities with the same lattice vector. 

In the next section we will present results for different clus- 
ter sizes and geometries. The 2x2 cluster is a bit special since, 
by symmetry, all sites are equivalent so that the quantities 
are already homogeneous over the cluster and the concept of 
"cluster momenta" is still meaningful. Another way to see this 
is that, since the cluster is entirely made of "surface sites", it 
fulfills periodic boundary conditions. For convenience's sake, 
in the following sections, we abbreviate M^, E^, and with 
M, E, and G, respectively. 
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FIG. 4. (Color online), 
neighbor components. 



Same as Fig. 2 but for the next-nearest- 



III. RESULT AND DISCUSSION 
A. Inhomogeneity of duster quantities 

We start with the inhomogeneity in the cluster quanti- 
ties. As we noted in the previous section, the inhomo- 
geneity is present because the CDMFT violates the trans- 
lational symmetry of the original lattice. In the 4x4 clus- 
ter, for example, the cluster sites, numbered in the right- 
most panel in Fig. 1(a), are categorized into three dif- 
ferent symmetry groups: {1,4, 13, 16}, {6, 7, 10, 11}, and 
{2, 3, 5, 8, 9, 12, 14, 15}. We pick up one site from each group 
and plot in Figs. 2(a)-(c) the absolute value of the local cluster 
quantities, M^-, S^-, and Gf^, against the Matsubara frequency 
for a hole-doped (n = 0.95) Mott insulator. The cluster quan- 
tities indeed depend on the group: While the dependence is 
small at high energy, it is more pronounced at low energy. 

An important finding in Fig. 2 is that the inhomogeneity 



occurs differently among M^, E^, and G^. To quantify it, we 
define a normalized deviation of the cluster quantities at each 
real- space vector r by 

where the average is defined by 

^'w^^J;: E (13) 

i,jGC;ri— rj=r 

and Nr is the number of pairs (i, j) satisfying the conditions 
i, j G C and — vj = r. Plotting the local (r = 0) component 
for each of M^, E^, and G^ in Fig. 2(d), we find that the nor- 
malized deviation is the smallest in M^, in particular, in the 
wide range of the relevant low-frequency region. This means 
that the cumulant is most homogeneous within the cluster. 
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FIG. 5. (Color online). Averaged cluster quantities for A^c = 16 
and n = 0.95 plotted against the Matsubara frequency, (a) The clus- 
ter cumulant, (b) self-energy, and (c) Green's function, (d) In |(5^| 
at the lowest Matsubara frequency ujq plotted against the Euclidean 
distance Irl. 



short distances. This behavior suggests that we are looking at 
the thermodynamic behavior already by the 4x4 cluster at the 
present temperature. 

When the temperature is reduced below 0.06t, the power- 
law decay expected to continue at longer distances in met- 
als is, of course, not properly captured by the 4x4 cluster. 
Since the energy resolution is set by the lowest Matsubara fre- 
quency ttT ~ 0.2t, the result suggests that the 4x4 cluster 
well describes the thermodynamic behavior within this en- 
ergy resolution (or at T > 0.06t). For instance, since the real 
and imaginary parts of Green's function have the denominator 
~ l/[(7rT)^ + (vf^)^], the correlation length of the exponen- 
tial decay in distance is ^ ~ ^f/ttT, where is the renor- 
malized Fermi velocity. In the underdoped region, the renor- 
malization factor is suppressed and may easily be of the or- 
der of 0.1, which is consistent with the behavior in Fig. 5(d). 
Because of the proximity to the Mott insulator, the correlation 
length (or coherence length) of Green's function is suppressed 
comparable to the lattice constant even at fairly low tempera- 
tures like 0.06t. Nevertheless, the characteristics such as the 
pseudogap, momentum differentiations and the Fermi arc for- 
mation are well developed in this temperature range. 



Figures 3(a)-(c) and 4(a)-(c) plot the nearest-neighbor and 
next-nearest neighbor components, respectively. These are 
the main sources of momentum dependence in the periodized 
quantities. We again see the inhomogeneity occurring in dif- 
ferent ways in M^, S^, and G^, and that the corresponding 
normalized deviations, shown in Figs. 3(d) and 4(d), are the 
smallest in at low energy. Note that a relatively large fluc- 
tuation in the normalized deviations at high energy is due to 
the small absolute values of the cluster quantities. 

The above results indicate that the inhomogeneity plays 
the weakest role for the M periodization scheme in the hole- 
doped Mott insulator. 



B. Locality of cluster quantities 

We next discuss how the cluster quantities decay with the 
real-space distance. Figures 5(a)-(c) plot the average quan- 
tities, M^, S^, and (5^, defined in Eq. (13) at various r 
against the Matsubara frequency. We see that (0,0), (1,0), 
and (1,1) components, which are within the 2x2 cluster, are 
much larger than the other components for all kinds of cluster 
quantities. In particular in and (5^, even the (1,1) compo- 
nent, the smallest one in the 2x2 cluster, is always more than 
two times larger than the longer-range components. This im- 
plies that the longer-range terms play a less important role in 
the periodization procedure, above all within the M scheme. 

In general. Green's function at a finite temperature should 
decay exponentially with |r| at long distances, and cross over 
to a power-law decay at short distances in a metallic state. To 
explore the decay, we plot in Fig. 5(d) In |(5^| at the lowest 
Matsubara frequency ujq against the Euclidean distance |r|. 
Although the data points fluctuate depending on direction, the 
decay seems to approximately follow an exponential decay at 
large distances and to cross over to a more moderate slope at 



C. Momentum dependence of periodized quantities 

We now turn to the quantities periodized through Eqs. (6) 
and (10) (i.e., the physical observables in the full momentum 
space). While in the limit of large cluster size all the peri- 
odization schemes should give the same result, it depends on 
the schemes how fast the results converge to the thermody- 
namic limit with increasing cluster size. Since the tractable 
cluster size is rather small at low temperatures, it is impor- 
tant to find out the most efficient periodization scheme in the 
relevant parameter region. In this section we compare the self- 
energies calculated by various periodization schemes for the 
2x2, 4x3, and 4x4 clusters, focusing on the slightly hole- 
doped region. We present the results for tilted clusters of 
A^c = 8 and 12 in Appendix B. Since Figs. 5(a)-(c) indicate 
that the momentum dependence becomes most significant at 
low energy, we focus on the self-energies at the lowest Mat- 
subara frequency ujq = ttT. 

Figure 6 compares the M, S, and G periodizations at 
n = 0.95. We first notice that all the results show a com- 
mon feature that ImE is small around (0, 0) and large around 
(7r,0) and (7r,7r). Looking at the convergence against the 
cluster size, we notice that the M-periodized self-energy fluc- 
tuates much less than the S- and G-periodized ones. This 
indicates a smaller size effect in the M periodization, con- 
sistently with the results discussed in Sees. Ill A and IIIB. 
Roughly speaking, the fluctuation is large around (tt, 0) and 
(7r,7r) while smaU around (0,0) and (|, |), reflecting the 
momentum-dependent amplitude of the self-energy. It is in- 
teresting that for the M periodization 4x3 and 4x4 cluster 
results agree well, in particular, around (0, 0) and (^, ^), in- 
dicating the self-energy is nearly converged there. It is also 
worthwhile noting that the 2 x 2 cluster result with the M pe- 
riodization already reproduces well the overall structure of the 




2x2 



A/c=8 



4x3 



4x4 



W 2 


Dc: 1 





Nc=^2, 16 

M — 

S — 
G — 


















FIG. 6. (Color online). Self-energies at the lowest Matsubara fre- 
quency, obtained by the M, S, and G periodizations for various 
square clusters at n = 0.95. 
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FIG. 7. (Color online). Self-energies obtained by the M periodiza- 
tion for various clusters at n = 0.97 and 0.99. 



4x3 and 4x4 cluster results. 

The bottom panels in Fig. 6 compare the 4 x 3 and 4x4 clus- 
ter results with the M, E, and G periodizations. We see a nice 
agreement around (0,0) and (f , f ), especially between the 
M and G schemes. The agreement between the different pe- 
riodization schemes corroborates our expectation for the con- 
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FIG. 8. (Color online), (a) Momentum maps of —l3G(k, f ) obtained 
through the M periodization for the chosen cluster sizes at n = 0.95. 
(b)-(c) Those of ImS(k, iojo) at n = 0.95, 0.97, and 0.99. Here the 
results with A^c = 8 are also presented for comparison. 



vergence. 

Figure 7 presents the M-periodized self-energy at n = 0.97 
and 0.99. The agreement between the 4x3 and 4x4 cluster 
results is still nice around (0, 0) and (f , f ) even at these small 
dopings. Also the 2x2 cluster result looks well reproducing 
these larger-cluster results. 

Figure 8(a) is the momentum map of the M-periodized 
Green's functions at r = | for the employed cluster sizes 
at n = 0.95. The quantity gives an estimate of the low-energy 
spectral weight because 



-/3G(k,T = /3/2) 



2 7-c 



A(k,w) 
cosh(;3w/2) 



duj 



(14) 



is approximately a spectral weight averaged over an energy 
width ~ T around = 0. Hence the momentum map in- 
dicates the shape of the Fermi surface. The results show 
the Fermi arc structure, found previously in 2x2 CDMFT 
studies,^"^'^^"^^, for all the cluster sizes. Remarkably the loca- 
tion of the arc does not change significantly with the cluster 
size. This is in agreement with the fast convergence of the 
self-energy around the nodal point seen in Fig. 6. Although 
we have not detected any indication of the Fermi pocket at 
this temperature (T = 0.06t), it is still consistent with the 
pocket structure at T = 0, as discussed in Refs. 1 1 and 12. 

Figures 8(b)-(d) are the momentum maps of the imaginary 
part of the M-periodized self-energy at n = 0.95, 0.97, and 
0.99. For all the cluster sizes and dopings ImE shows a strong 
intensity around the cut along ( ^ , tt) — (tt, ^ ) . The large | ImE | 
indicates the presence of a zero surface of Green's function at 
T = 0. The indicated location of the zero surface for the 
2x2 cluster is consistent with that previously found by 2 x 2 
CDMFT+ED studies,^' while it deviates from the one as- 
sumed in Ref. 29. It is remarkable that the location of the zero 
surface does not change significantly with the cluster size. It 
seems hardly moved with reducing doping from 5% to 1% as 
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well. This is quite unexpected from the viewpoint of the ex- 
tended Luttinger sum rule^^ which argues that the sum of the 
volume enclosed by the Fermi surface and by the zero sur- 
face is equal to the electron filling. If this rule holds, such 
a volume should be much smaller than the one indicated in 
Fig. 8 because the volume should approach to half of the full 
Brillouin zone as the filling approaches to the half filling. Fig- 
ure 8 indicates an anomalous metallic phase characterized by 
the simultaneous presence of both zero and Fermi surfaces. 
This phase is separated by a quantum phase transition from 
the Fermi liquid/^ which we find at high doping and which 
appears, within our numerical precision, to fully respect the 
Luttinger sum rule. 

There have been intensive debates on the applicability of 
the Luttinger sum rule in strongly correlated region. ^^'^^ Al- 
though the present results strongly suggest that the sum rule 
is violated in the underdoped region, the cluster-size depen- 
dence of E remaining around (tt, 0) and (tt, tt) makes difficult 
to make definitive statements. Future studies on larger clusters 
are highly desired to settle this issue. 

The present study indicates that many of the characteristic 
features of the doped Mott insulators summarized as the mo- 
mentum differentiation and identified in the pseudogap and 
Fermi arc formation are well captured even at the temperature 
scale of 0.06t or above. In this range of temperature, the ap- 
propriate M periodization scheme allows the convergence to 
the thermodynamic limit at a relatively small cluster size. If 
one wishes to see the growth of the electron coherence with 
higher energy resolution close to the Fermi level, one needs 
to go to lower temperatures together with the corresponding 
larger cluster size. Increasing the resolution in the momen- 
tum space simultaneously at lower temperatures beyond the 
present study is a challenge left for future studies. 



IV. SUMMARY AND CONCLUSION 

We have extended the cellular DMFT to clusters larger than 
the conventionally used 2x2 one, and systematically studied 
the cluster- size dependence of various quantities. While the 
CDMFT sacrifices the translational symmetry of the original 
lattice, it can provide through a periodization a fine structure 
in the momentum space. This is an interesting information, 
complementary to DC A results, where the self-energy is as- 
sumed to be flat in each momentum patch. Our strategy is to 
find and use an efficient quantity for the periodization, to be 
able to extract the thermodynamic behavior from a relatively 



small cluster calculation. 

In order to achieve this task, we have explored how ho- 
mogeneous and local the various cluster quantities are. Fo- 
cusing on the parameter region of hole-doped Mott insula- 
tors, we have found that the cluster cumulant is the most lo- 
cal and homogeneous quantity, favorable for the periodiza- 
tion. The comparison of the self-energies obtained by vari- 
ous periodization schemes shows that the fastest convergence 
against cluster size is obtained by the periodization of the cu- 
mulant M. The convergence depends on momentum: While it 
seems converged already at the 4x4 cluster around (0, 0) and 
^), a distinct size dependence still remains around (tt, 0) 
and (7r,7r). We have also found that the 2x2 cluster with 
the M periodization remarkably well reproduces the overall 
structure of the self-energy obtained with the 4x3 and 4x4 
clusters. The Fermi arc structure and the location of the low- 
energy zero surface, calculated through the M periodization, 
seem only weakly dependent on the cluster size, corroborating 
the picture of the Mott physics obtained by previous CDMFT 
studies. This result would imply a violation of the extended 
Luttinger theorem at small doping. 
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APPENDIX A: HIGH-FREQUENCY EXPANSION AND 
RELATION BETWEEN M, S, AND G PERIODIZATIONS 

We first give a simple argument why the correct imple- 
mentation of the G periodization scheme is given by Eq. (10) 
rather than by Eq. (6) with = G^. 



The two Green's functions (5(k, z) and G^{z) defined in Eqs. (5) and (4), respectively, can be expanded for large values of 

Z = iuOn + 11'. 

G(k, z) = [zi - £(k) - ~^ = -J^-^ {K^) + ^''(^)) + • • • . (15) 



Nc 
(2^ 



dkG(k, z) 



RBZ 



Nc 



[ dk( 

Jrbz ^ 



zi-i{k) - E^(^) 



J ' 

JRBZ 



dki{k) + ±^{z) 



(16) 
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Let us note that, for the present analysis of the different periodization schemes, the expHcit Xj z expansion of ^^{z) is not 
necessary. 

Written in this form, it is simple to see what happens if one periodizes G(k, z) and &~'{z), that is, if one applies Eqs. (10) and 
(6): 



GL(k,z) 



lfromG(k,2) js^ 



Gij(k,2;) 



1 



1 1 



(17) 



and 
GL(k,z) 



I from 0^(2) 



1 y g«k.(r.-r,)^C (^)^ 1 



1 1 



^ gik.(r,-r,) 



(2^)2 



dk t,,(k) + I]^^.(^) 



Using the definition given just below Eq. (5), we can re-express the term involving f(k) in Eq. (17) as 



1 



and that in Eq. (18) as 



tij{k) = ^Y1 e'^-^'^^-'^^-)^e-^(*^+^>("^-'^^)e(k + K) = e(k), 



Nc 



ijec 



K 



ik.(r,-r,) 



Nc 



(18) 



(19) 



(20) 



ijec 



where denotes the hopping matrix restricted to the cluster. The left-hand side of Eq. (20) is different from the full lattice 
dispersion e(k) that one would like to be reproduced in the lattice Green's function at the second order in the 1/z expansion. Such 
a simple argument thus shows that the periodization of the cluster Green's function matrix [i.e., Eq. (18)] is not an appropriate 
periodization scheme. 

I 

deviate from it. This is indeed observed in our CDMFT solu- 
tion for doped Mott insulators, where the self-energy is large 
(see bottom panel in Fig. 6). 

These considerations can be also clarified by expanding the 
G-periodized Green's function [Eq. (10)] with respect to the 
cumulant (if the self-energy is large compared to the bare hop- 
ping terms, M^i is small): 

G(k, iujn) - M'^{ic^n)i{^)\ ^ M^{i^n) 



Conversely one can easily show that at the second order 
in the 1/z expansion the M- and E-periodized Green's func- 
tions yield the same result as Eq. (17), and in particular they 
reproduce the right band dispersion. The G-, M-, and E- 
periodization schemes differ instead already at the third order 
in In the G periodization the third order term in the 1/z 
expansion reads 



1 1 



gik.(r,-r,) 

ijec 

while in the M periodization it is 



(21) 



(24) 



1 



ijec 

and in the E periodization it is 



1 



6(k). 



Nc 



1 ^ 



ijec 



(22) 



(23) 



Hence, ^^(k, iujn 



Ifrom G{\<.^ioJn) 



M^(k, iujn) at the first or- 



der. 

On the other hand, in the M periodization, the lattice 
Green's function is given by Eq. (8), so that 



ML(k, iuJn) 



GL(k,ia;0 Ifrom MC(,,^) - i_M^{k,iUn)e{k) 
M^ik, iujn) + M^{k, iujn)e{k)M^{k, iujn) + ' ' • (25) 



at 



We notice that the M and G periodizations share the same 
term proportional to (E^)^ (while M and E periodizations 
share the term ekEk). Therefore, in a strongly correlated 
regime where the self-energy is large compared to the bare 
dispersion, we may expect that the M and G schemes produce 
a similar G^, while the E-periodized Green's function should 



Thus GL(k,ZCJn) Ifrom (^(k,ic..) = (^^ ^^n) Ifrom MC 

the first order in M^t, and we expect that the M and G 
periodizations produce similar results close to the insulating 
state, even at energy scales lower than the range of valid- 
ity of the 1/z expansion. We stress that the lattice Green's 
function obtained with the E periodization gives a result a 
priori very different. This is because the periodization of 
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M periodization 

-1 1 1 \ 1 .4I \ 1 

(0,0) (71,0) {n,n) (0,0) (0,0) (7:,0) {n,n) (0,0) 



FIG. 9. (Color online). Self-energies obtained by M periodization at 
n = 0.95 for tilted clusters of A^c = 8 and 12. For comparison the 
result for the 4x 3 cluster is also plotted. 

fl^{iujn) = i^n -\- /J^ — M^{iuJn)~^ corresponds to the peri- 
odization of M^{iujn)~^ , which gives a lattice cumulant very 
different from the one obtained with the direct periodization 
of the cluster cumulant M^{iujn)- 



APPENDIX B: EFFECT OF CLUSTER GEOMETRY 

In general we still have several options in the cluster shape 
even when the cluster size is fixed. The cluster shape signif- 
icantly affects the inhomogeneity of cluster quantities since 
each cluster site is differently connected to the other clus- 



ter sites and/or bath sites. For example, "surface" sites on 
the boundary of the cluster behave differently from the inner 
sites. Since nonlocal correlations will be more accurately in- 
corporated in the inner-site self-energy^"^ than in the surface- 
site one, a strategy to choose the cluster shape is to minimize 
the effect of the surface sites. 

This was already demonstrated in Figs. 2 and 3. When com- 
paring the results at the corner site 1 and at the other surface 
site 2, the former deviates more from those at the inner site 6 
than the latter. This reflects the fact that site 1 has only two 
nearest neighbors in the cluster, while site 2 has three neigh- 
bors. 

Here we compare the two different 12-site clusters shown 
in Fig. 1(a). An important difference is that the Nq = 12' 
cluster has two tip sites coupling to only one nearest neighbor 
while the 4x3 cluster has no such sites. The cluster cumulants 
associated with these tip sites indeed show a peculiar behavior 
largely deviated from the inner ones (not shown). 

The difference is reflected in the M-periodized self-energy 
plotted in Fig. 9, where we see a large deviation around (tt, 0) 
and (tt, tt) while a fairly nice agreement around (0, 0) and 
(^, ^). We see that the Nq = 12' result is rather closer to 
the Nc = 8 one, which also has two tip sites. 

The result indicates that we should avoid clusters contain- 
ing a tip site, in order to obtain a fast convergence to the ther- 
modynamic limit. A simple prescription for this is to take a 
square cluster whose edges are parallel to the lattice vectors, 
as we have done so far. 
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